📕 subnode [[@KGBicheno/the ubuntu training corpus]]
in 📚 node [[the-ubuntu-training-corpus]]
📓
garden/KGBicheno/Artificial Intelligence/Brook's Speech Function/The Ubuntu training corpus.md by @KGBicheno
The Ubuntu training corpus
From [[Main Library - Chatterbot]]
The Ubuntu training corpus is an enormous (3gb) collection of conversational text from Ubuntus tech-support system.
It's heavily biased and hopelessly garbage but should allow for a decent starting point.
Locations
Structure
Usage
Outputs
Benchmarks
Notes
See [[Why I didn't use Chatterbot]]
📖 stoas
- public document at doc.anagora.org/the-ubuntu-training-corpus
- video call at meet.jit.si/the-ubuntu-training-corpus